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Abstract 

Background: The aim of this study was to evaluate the test-retest reliability and the validity of the self-reported 
questionnaire Olerud-Molander Ankle Score (OMAS) in subjects after an ankle fracture. 

Methods: When evaluating the test-retest reliability of the OMAS, 42 subjects surgically treated due to an ankle 
fracture participated 12 months after injury. OMAS was completed by the patients on two occasions at one to two 
weeks' interval. Concurrent criterion validity was evaluated using the five subscales of the Foot and Ankle Outcome 
Score (FAOS) and global self-rating function (GSRF), which is a five-grade Likert scale with the alternatives: "very 
good", "good", "fair", "poor", "very poor". Forty-six patients participated in the validation against FAOS, and for GSRF 
105 patients participated at 6 months and 99 at 12 months. Uni-, bi- and trimalleolar fractures were all included and 
both non-rigid and rigid surgical techniques were used. All fractures healed without complications. Before analysis 
of the results the five groups according to GSRF were reduced to three: "good", "fair" and "poor". Test-retest 
reliability was assessed using Spearman's rank correlation, the intraclass correlation coefficient (ICC), the standard 
error of measurement (SEM and SEM%) and the smallest real difference (SRD and SRD%). The Cronbach's alpha 
score and validity versus FAOS was assessed using Spearman's rank correlation and validity versus GSRF using the 
Kruskal-Wallis Test and the Mann-Whitney U-Test as ad hoc analyses. 

Results: The test-retest reliability correlation coefficient obtained was rho = 0.95 and ICC = 0.94. The SEM was 4.4 
points and SEM% 5.8% and should be interpreted as the smallest change that indicates a real change of clinical 
interest for a group of subjects. The SRD was 12 points and SRD% 15.8% and should be interpreted as the smallest 
change that indicates a real change of clinical interest for a single subject. The correlation coefficients versus the 
five subscales of FAOS ranged from rho = 0.80 to 0.86. There were significant differences between GSRF groups 
"good", "fair" and "poor" (p < 0.001) at both the six-month and the 12-month follow-up. The internal consistency for 
the OMAS was 0.76. The effect size between results from 6-month and 12-month follow-up turned out be 0.44 and 
should be considered as medium. 

Conclusion: The results showed that the test-retest reliability of the Swedish version of OMAS was very high in 
subjects after an ankle fracture and the standard error of measurement was low. Furthermore the OMAS was found 
to be valid using both the five subscales of FAOS and the GSRF. The OMAS can thus be used as an outcome 
measure after an ankle fracture. 
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Background 

Fractures involving the ankle are increasing [1] and are 
one of the most common fractures in the lower extremity 
[2,3] with an incidence rate of 101 fractures per 10 
person-years [3]. Ankle fractures occur in all ages and du- 
ring different types of daily activities [4,5]. The age- 
adjusted incidence rate for the two genders has been 
reported as equal [4], but in younger ages the incidence 
rate is higher among men and at the age of 50 the gender 
ratio reverses [2,4]. The number of ankle fractures in the 
elderly is increasing [1,6] and in women over 65 years of 
age it has been found to be 300 per 10 s person-years [7]. 
Most fractures are surgically treated with open reduction 
and internal fixation due to dislocation [8-12]. After sur- 
gery the ankle is normally immobilized in a below knee 
plaster cast [13-15] or in a brace [16-18] for six to eight 
weeks. 

Many investigators have evaluated both short- and 
long-term results after surgery [16,19-25]. Radiographic 
assessments [23,26], ankle mobility [16,22,26-29] and 
muscle strength [16,22,29,30] have been studied. It has 
however become more common to use patient-reported 
scores to evaluate functional results, as has been done 
by many authors [16,18,22,28,31]. As suggested by the 
International Classification of Functioning, Disability 
and Health (ICF), the degree of impairments, disabilities, 
participation problems and health related quality of life 
should be described from the patient's perspective [32]. 
Patient-reported instruments such as questionnaires are 
appropriate instruments for this purpose. 

Several scores have been developed in order to evalu- 
ate function after ankle injuries [26,33-35]. The Karlsson 
score [33] is a patient self-reported questionnaire while 
the Kaikkonnen score [34] has to be completed by both 
patient and clinician as six of the nine items consist of 
clinical tests. Both are mainly intended to evaluate func- 
tion after ankle ligament injuries and both have been 
found to be valid [36]. The Foot and Ankle Outcome 
Score (FAOS) is a self-reported questionnaire and was 
developed to assess function in a variety of foot and 
ankle-related problems (www.koos.nu). FAOS has been 
found valid and reliable [35]. 

The Olerud-Molander Ankle Score (OMAS) is a 
disease-specific questionnaire devised for patients with 
ankle fractures and has been frequently used to evaluate 
subjectively scored function in this patient group 
[16,19,22,23,26,29,31,37]. OMAS has been validated 
against: Linear Analogue Scale (LAS) assessing subject- 
ive evaluation of ankle function on a 15 cm long linear 
analogue scale with the ends marked "perfectly normal 
ankle" and "totally disabling ankle" (p < 0.01), range of 
motion in dorsal extension (p < 0.05), presence of osteo- 
arthritis grade II-IV (p < 0.001) and presence of disloca- 
tions on radiographs (p < 0.05) after an ankle fracture 



[26]. It has also been found to discriminate for subjec- 
tively experienced ankle instability (p < 0.02) and muscle 
strength in the ankle dorsi- and plantar flexors (p < 0.02) 
[22]. Wees et al. examined concurrent validity in pa- 
tients with acute ankle ligament injuries and compared 
the Ankle Function Score (AFS) and the OMAS. The 
concurrent validity between the two scores at baseline 
and at follow-up was found to be good (r p = 0.82 and 
0.70) [38]. Furthermore OMAS has been reported ca- 
pable of recording change over time in the short-term 
after an acute ankle ligament injury. The day seven effect 
size was reported to 1.3 and day 14 to 2.3 [36,39]. No 
floor or ceiling effects have been found when evaluating 
patients with acute ankle ligament injuries [38]. Al- 
though frequently used, few methodological studies re- 
garding OMAS have been performed [22,26,38,39] and 
to the best of our knowledge no studies have been pub- 
lished assessing the reliability of the OMAS instrument 
[36], effect size in the long-term or validity using an- 
other disease-specific questionnaire or self-rated func- 
tion with a graded rating scale in patients with surgically 
treated ankle fracture. The aim of this study was there- 
fore to evaluate the test-retest reliability and the validity 
of the self-reported questionnaire OMAS in subjects 
after an ankle fracture. 

Methods 

Design and participants 

This is a test-retest reliability study and a concurrent 
validity study of OMAS. All participants were from the 
same cohort and were recruited at the University Hos- 
pital in Lund, Sweden. This study is a part of an earlier 
intervention study [29] which was approved by the Re- 
search Ethics Committee at Lund University, Lund, 
Sweden. Written informed consent had to be provided 
for participation in the study. 

Test-retest reliability of OMAS 
Participants 

Forty-two subjects, 23 women and 19 men surgically 
treated due to an ankle fracture, participated in the 
evaluation of the test-retest reliability. The mean age 
was 42 (SD 14), 23 had a unimalleolar fracture and 19 
had a bi- or trimalleolar fracture. Both non-rigid and 
rigid surgical techniques had been used. Fracture types 
and surgical techniques that were used have been des- 
cribed in detail elsewhere [29]. All subjects had been 
immobilized non-weight-bearing in a below-knee plaster 
cast and the plaster time was mean 43 days (SD 5.5). All 
fractures healed without complications. 

Test procedure 

The test-retest reliability of OMAS was studied 12 months 
after injury. The questionnaire was completed on two sepa- 
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rate occasions. The first time OMAS was filled in at home 
by the 42 subjects and then sent by mail to the test leader 
(GN). The second time OMAS was filled in at the clinic 
when the subjects were called for a physical examination as 
part of a follow-up study [29]. The two sessions were 
spaced one to two weeks apart. The subjects did not have 
access to the first version when the second was filled in. 

Validity of OMAS using global self-rated function 
Participants 

One-hundred-five patients, 63 women and 42 men, par- 
ticipated in the validation of the OMAS at 6 months. 
The mean age was 44 (SD 14); 65 subjects had a 
unimalleolar fracture and 40 had a bi- or trimalleolar. At 
12 months, 99 patients, 59 women and 40 men partici- 
pated, the mean age was 45 (SD 14) and 64 had a 
unimalleolar and 35 bi- or trimalleolar fracture. Surgical 
techniques and immobilization routines were the same 
as described above. 

Test procedure 

OMAS and the global self-rated function (GSRF) were 
filled in at home six and twelve months after the injury 
and then sent by mail to the test leader (GN). 

Validity of OMAS using the Foot and Ankle Outcome 
Score 

Participants 

Forty-six patients, 26 women and 20 men, participated in 
the validation of OMAS versus the Foot and Ankle Out- 
come Score (FAOS). The mean age was 43 (SD 14) and 25 
had a unimalleolar fracture and 21 a bi- or trimalleolar. 
Surgical techniques and immobilization routines were the 
same as described above. 

Test procedure 

OMAS was filled in 12 months after the injury and sent 
by mail to the test leader. FAOS was filled in at the clinic 
when the subjects were called for a physical examination 
as part of a follow-up study [29]. 

Outcome measures 
OMAS 

OMAS is a self-administered patient questionnaire [26]. 
The scale is an ordinal rating scale from 0 points (totally 
impaired function) to 100 points (completely unimpaired 
function) and is based on nine different items given diffe- 
rent points: pain (0-25), stiffness (0-10), swelling (0-10), 
stair climbing (0-10), running (0-5), jumping (0-5), 
squatting (0-5), supports (0-10) and work/activity level 
(0-20). The different symptoms are stated and have differ- 
ent points according to the extent of disability the authors 
considered they would lead to [26]. The score is calculated 
as the sum of each rated item. The original version of the 



OMAS was created in Swedish and this version was the 
subject of the present study. Some minor changes were 
made in the version that was tested in the present study. 
In item three, dealing with swelling, the response alterna- 
tives in the original version were "none", "only evenings" 
and "constant". In the version that we tested the alter- 
natives were "none", "after overuse or only during the 
evening" and "constant". In item four, dealing with stair- 
climbing, the response alternatives in the original version 
were "no problems", "impaired" and "impossible". In the 
version that we tested the item was changed to "stairs" in- 
stead of "stair climbing" and the response alternatives were 
"no problems", "some problems" and "impossible". These 
changes were made with permission from the Swedish de- 
veloper, Professor Claes Olerud (16 March 2012). 

GSRF 

The GSRF is a self-administered ordinal five-grade rating 
scale. The patients have to evaluate their present ankle 
function using five alternatives: "very good", "good", 
"fair", "poor" and "very poor". 

Global self-rated ankle function can be assessed, for 
example, by LAS [17,22,26,33] or by a five-grade Likert 
scale (excellent, good, fair, poor, and very poor). 

FAOS 

FAOS is also a self-administered patient questionnaire 
and consists of 42 items divided into five subscales: pain 
(9 items), other symptoms (7 items), function in daily liv- 
ing (ADL) (17 items), function in sport and recreation and 
foot (4 items) and ankle-related quality of life (5 items). 
Standardized options are given and for each item a five- 
point Likert scale is used (no, mild, moderate, severe, ex- 
treme). Each item gets a score from 0-4 and each of the 
five subscale scores is calculated as the sum of the rated 
items included. Raw scores are then transformed to a scale 
0 (indicating extreme symptoms) to 100 (indicating no 
symptoms) (www.koos.nu). 

FAOS, developed from the self-reported questionnaire 
KOOS (Knee Injury and Osteoarthritis Outcome Score), 
has been found to be reliable over time in subjects with 
surgically treated ankle ligament injuries [35] and valid 
against three subscales of SF-36 (bodily pain, physical 
functioning and social functioning) (p < 0.01) in subjects 
with different foot and ankle disorders in the Turkish 
version of FAOS [40]. Furthermore, it has been found 
valid against the patient-reported instrument Achilles 
Tendon Total Rupture Score (ATRS) in subjects with a 
total achilles tendon rupture. All subscales of FAOS cor- 
related well with that instrument (p < 0.01) [41]. 

Statistics 

Statistical analyses were performed using the SPSS soft- 
ware version 17.0. As OMAS is an ordinal scale non- 
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parametric statistics have been used. However, to be able 
to analyze the standard error of measure (SEM), the 
smallest real difference (SRD) and effect size (ES) also 
the mean values and standard deviations a had to be ap- 
plied. All correlation coefficients (rho) were calculated 
using Spearman's rank correlation, with a coefficient 
level of < 0.5 considered as low, 0.5-0.69 as moderate, 
0.7-0.89 as high and 0.9-1.0 very high [42]. To analyze 
the agreement between the two repeated measurements 
at 12-month the intraclass correlation coefficient (ICC) 
was applied. To check for systematic error between the 
two measurements, the Wilcoxon's signed rank test was 
used. Internal consistency of OMAS was calculated 
using Cronbach's alpha [43]. The standard error of 
measure (SEM) was defined by SEM = SD V(l-ICC) and 
SEM% by (SEM/mean) x 100 where mean is the mean 
for all values from test session 1 and 2. The smallest real 
difference (SRD) was defined by SRD = 1.96 x SEM x V2 
and the SRD% by SRD/mean x 100 where mean is the 
mean for all values from test session 1 and 2. An 'error 
band' around the mean difference of the two measure- 
ments, d, was defined by 95% SRD = d± SRD [44]. Effect 
size of OMAS between six-month and twelve-month 
follow-up was calculated as (mean value of measurement 
2 - mean value of measurement 1)/SD of measurement 
1 [45]. Significance was considered at the alpha level of 
p < 0.05. Before the statistical analysis of the validity of 
OMAS versus the five-grade rating scale, the subjects 
were reduced to three groups. Those who had answered 
"very good" and "good" formed one group (Group 1), 
"fair" formed one group (Group 2) and those who had 
answered "poor" and "very poor" formed one group 
(Group 3). When comparing the results between the 
three groups the Kruskal Wallis test was used and as ad 
hoc between each group the Mann- Whitney U-test was 
applied. 

Results 

Test-retest reliability and internal consistency 

No significant differences were found between the two 
measurements of OMAS (p = 0.14) and the correlation 
(rho = 0.95) and agreement (ICC = 0.94) were both very 
high (Table 1). The standard error of measurement 
(SEM) which represents the smallest change that indi- 
cates a real (clinical) improvement or worsening for a 
group of subjects was 4.4 points and SEM% was 5.8%. 



The smallest real difference (SRD) which is the equiva- 
lence for a single subject was 12.0 points and the SRD% 
was found to be 15.8%. The 95% SRD which represents 
the limits for the smallest change for a single subject 
ranged from -10.4-13.7. The internal consistency of the 
9 items was 0.76 as calculated with Cronbach's alpha. 

Validity of OMAS versus GSRF 

There were significant differences in the scoring rates 
of OMAS between the three groups of GSRF at both the 
six-month (Table 2) and the 12-month follow-up 
(Table 3). 

Validity of OMAS versus FAOS 

The correlation between OMAS and the five subscales 
of FAOS was high. All correlation coefficients reached 
or exceeded 0.8 (Table 4). 

Score distribution and effect size 

OMAS varied from 0-100 at six months and 15-100 at 
12 months. One person scored 0 at six months. Six per- 
sons (5%) scored 100 at six months and 18 persons 
(15%) scored 100 at 12 months. Effect size turned out to 
be 0.44 calculated as (73.04-62.99)/23. 

Discussion 

The main results of this study showed that the test- 
retest reliability and concurrent validity of the OMAS 
were good for patients surgically treated due to an uni- 
bi or trimalleolar ankle fracture. 

Reliability is an important dimension of any patient- 
based outcome measure as it is essential to establish 
whether changes observed are due to the intervention 
and not to variations related to problems with the out- 
come instrument. The larger the random error, the lar- 
ger sample size is needed in order to obtain precise 
estimates of effects in a trial [45]. There are two aspects 
that have to be considered when evaluating reliability: 
reproducibility and internal consistency. A correlation 
coefficient in a test-retest of a measurement tool should 
be at least 0.70 when studying groups of patients and 
exceed 0.9 when studying individuals [45,46]. Inter- 
nal consistency can be evaluated using Cronbach's alpha 
(0-1) [43]. When items are used to form a scale they 
should have internal consistency, which means they 
should measure the same thing and be correlated to 



Table 1 Values of OMAS at first and second measurements and correlation between the two measurements 



Variable 


First measurement 


Second measurement 


p-value 


Correlation 


p-value 




n = 42 


n = 42 




correlation 


OMAS median (range) (1QR) 


78 (30-100) (26) 


80 (35-1 00) (30) 


p = 0.14n 


rho =0.95 


p < 0.001 


OMAS mean (SD) 


75 (19) 


77 (18) 




ICC = 0.94 


p < 0.001 



nWilcoxon sign rank test; rho= spearman correlation coefficient; IQR, Interquartile range; ICC, intraclass correlation coefficient. 
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Table 2 OMAS versus global self-rated function six months after injury 



Variable 


Group 1 


Group 2 


Group 3 


p-value 




good 


fair 


poor 






n = 44 


n = 36 


n = 25 




OMAS 6-month median (range) (IQR) 


80 (0-100) (19) 


60 (20-85) (20) 


40 (10-60) (25) 


p< 0.001n 



nKruskal-Wallis Test. 

Ad hoc analysis: Group 1 vs Group 2 = p < 0.001 (Mann-Whitney (/-test). 
Ad hoc analysis: Group 2 vs Group 3 = p < 0.001 (Mann-Whitney (/-test). 



each other. When comparing groups an alpha level of 
0.7-0.8 is recommended [43,45]. Too high levels of 
alpha indicate that all items are identical, addressing a 
rather narrow aspect of an attribute [43] or there is re- 
dundancy among items [45]. Too low levels indicate that 
the items included in the scale are not related to each 
other [43]. 

To the best of our knowledge no previous studies have 
evaluated the test- retest reliability of OMAS in subjects 
surgically treated due to an ankle fracture. Both the 
Spearman's rank correlation coefficient and the ICC bet- 
ween the two measurements turned out to be very high 
(rho =0.95 and ICC = 0.94), although the circumstances 
of the two occasions when the questionnaires were filled 
in were not exactly the same; the first version was filled 
in at home and the second at the clinic. Cronbach's 
alpha was within the recommended limits (0.76) and 
thus the OMAS can be regarded as a reliable measure in 
subjects after an ankle fracture. 

Even if an ICC value is high it does not mean that a 
test is appropriate for clinical use. Before recommending 
a test for that issue the measurement errors both for 
groups of subjects and for single individuals have to be 
analyzed as well. The measurement error of an instru- 
ment should be small and sensitive enough to detect real 
changes in scored function. In the present study the 
SEM and SRD were used which gave the measurement 
errors in absolute values. SEM was 4.4 points and SEM% 
was 5.8%. These figures are both low and should be 
regarded as indicative for true change beyond measure- 
ment error and applied as the smallest difference bet- 
ween two measurements when evaluating for example 
an intervention on group level. The 95% SRD ranged 
from -10-13.7 points and indicates that a real clinical 
change for a single subject should exceed this range. 
The SRD% is independent of the unit of measurement 



like SEM% and may be more easy to use in clinical prac- 
tice. The SRD% found was 15.8% and thus if a subject 
scores for example 60 points this subject must improve 
10 points to indicate a real change. From a clinical point 
of view these values seem to be reasonable and confirm 
that OMAS can be used to detect real changes in sub- 
jects after an ankle fracture. 

Criterion validity is the extent to which one measure 
is related to other measures or outcomes. This type of 
validity can be divided into either concurrent or predic- 
tive validity; concurrent validity is when a new tool is to 
be compared at the same time with another measure- 
ment as gold standard [47]. Often, however, a true gold 
standard against which a new measurement can be com- 
pared does not exist, and this was the case in our study. 
In the present study OMAS was validated using two dif- 
ferent instruments, the disease-specific questionnaire 
FAOS and the global rating scale, GSRF. FAOS has been 
tested for reliability and validity in the Swedish version 
[35], the Turkish version [40] and the Iranian version 
[48]. The ICC values for test-retest reliability were high 
[40,48] and the validity using SF-36 varied between low 
and moderate in the Iranian version [48] and between 
low and high in the Turkish version [40], but in that 
study only three subscales of SF-36 were presented. 
SF-36 is an instrument evaluating generic health-related 
quality of life, including equal parts of mental health and 
physical health, whereas FAOS is a disease-specific in- 
strument evaluating functional outcome of the ankle. 
Thus not all subscales of SF-36 might be expected to 
correlate well with FAOS. 

We found that the correlations between OMAS and 
the five subscales of FAOS were all high. When looking 
at the figures it can be noticed that for the subscale ADL 
in FAOS the median value was 99 and the interquartile 
range was nine. These figures are more extreme than 



Table 3 OMAS versus global self-rated function 12 months after injury 



Variable 


Group 1 


Group 2 


Group 3 


p-value 




good 


fair 


poor 






n = 58 


n = 26 


n = 12 




OMAS 12-month median (range) (IQR) 


85 (50-100) (25) 


60 (30-75) (25) 


45 (15-75) (19) 


< 0.001n 



nKruskal-Wallis Test. 

Ad hoc analysis: Group 1 vs Group 2 = p < 0.001 (Mann-Whitney tMest). 
Ad hoc analysis: Group 2 vs Group 3 = p = 0.021 (Mann-Whitney tMest). 
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Table 4 Validity of OMAS versus the five subscales of the foot and ankle outcome score 


OMAS median (range) (IQR) 


FAOS subscales median (range) (IQR) 


Spearman's 


p-value 


n = 45 


n = 45 


correlation 


correlation 


coefficient 








(rho) 




80 (30-100) (25) 


Pain 92 (47-100) (22) 


0.86 


<0.001 


80 (30-100) (25) 


Symptoms 79 (30-100) (37) 


0.81 


<0.001 


80 (30-100) (25) 


ADL 99 (54-100) (9) 


0.80 


<0.001 


80 (30-100) (25) 


Sports 80 (0-100) (45) 


0.85 


<0.001 


80 (30-100) (25) 


QoL 75 (25-100) (26) 


0.81 


<0.001 



IQR, Interquartile range. 



they are for the rest of the items, and it seems as if the 
suggested functions were minor problems for the pa- 
tients included in our study. FAOS has been developed 
from the self-reported questionnaire KOOS aimed at 
evaluating function in subjects with osteoarthritis in the 
knee, and the questions are adjusted and relevant to 
those patients. We think that not all items might be suit- 
able for patients after an ankle fracture especially, not in 
the ADL subscale where many items deal with problems 
in non-weight-bearing positions such as "bending to 
floor", "putting on socks", "taking off socks", "lying in 
bed". It is logical to believe that if patients do not iden- 
tify with the problems presented, the total score of those 
items would be rated higher. In the validation study of 
FAOS the authors came to the same conclusion; only 
two of 17 items were considered as "at least of some im- 
portance" by the responders [35]. Regarding the subscale 
pain, the correlation was high but again the numerical 
value for FAOS was higher, and again many of the items 
of that subscale of FAOS deal with pain in non-weight 
-bearing positions such as "at night while in bed", "sit- 
ting or lying", "bending foot/ankle fully", "stretching 
foot/ankle fully". These situations are probably a minor 
problem in the studied group with ankle fractures. 

Global rating of function was the traditional way of 
presenting results after treatment of injuries or diseases 
in earlier studies [8,11,49]. As a clinician it is important 
to take into account the patient's overall opinion about 
recovery and function after an injury. It should be opti- 
mal if the returned responses from a disease-specific in- 
strument agree with the patient's overall rated function 
from the same region. OMAS has been shown to corre- 
late well in that respect using LAS [26]. In the present 
study the evaluation using GSRF agreed well with results 
from OMAS. The points from the three groups "good", 
"fair" and "poor" were significantly separated at both the 
six-month and the 12-month follow-up. 

Responsiveness is one dimension of great importance 
to determine when evaluating the methodological quality 
of a measurement. The responsiveness of an instrument 
expresses its capability to detect changes over time or 
changes due to intervention. In the present study the 



responsiveness was evaluated by calculating the effect size 
and was found to be 0.44 which could be considered as 
medium [45]. In the study by Rose et al. the effect size 
was evaluated in the early stages after an acute ankle liga- 
ment injury and was found to range from 1.3 to 2.3 [39]. 
It is well known that ankle ligament injuries recover 
quickly during the first two weeks, thereby the figures bet- 
ween that group and the group that we have focused on 
should differ. Between six and 12 months after an ankle 
fracture the improvement can be expected to progress 
more slowly and by then the effect size found should be 
regarded as realistic. There is a lack of knowledge how the 
responsiveness of OMAS is in the early phases of the re- 
habilitation process after ankle fractures and further stud- 
ies regarding this are required. 

However, to be able to detect changes over time the 
floor and ceiling effects should be considered as well, 
which means the number of patients reporting the low- 
est score at baseline should be limited in order to be 
able to observe deterioration. In the same way ceiling ef- 
fect occurs when a patient reports excellent function 
and receives the best possible score. Questionnaires with 
good validity are expected to have fewer categories with 
floor or ceiling effects and no more than 15% of the in- 
dividuals should score on these levels [40]. In the 
present study no floor or ceiling effects regarding 
OMAS were found at six months. At 12 months 15% 
scored 100, which is within the recommended limits, 
and one year after injury it can be expected that some of 
the patients have attained normal function. 

OMAS is well known and has been used in lots of 
studies for several years [16,17,19,22,23,26,28,29,37,50]. 
With relatively few items it can be easily completed and 
the items included are all relevant to normal activities of 
daily life. The score is simple for the researcher to use as 
the raw score is summed up without any further calcula- 
tion. The different symptoms are given and have diffe- 
rent points according to the extent of disability the 
authors considered they would lead to [26]. It seems as 
if these differences are reasonable, as OMAS and self- 
reported global function using both LAS [26] and GSRF 
in the present study have shown good correlations. Pain 
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during different weight-bearing situations is the item 
that is highest weighted. It seems relevant as the foot 
and ankle take weight during every step all day long and 
pain in these situations should be strongly disabling. In 
two of our earlier studies we found that one year after 
the injury more than half of the patients still experienced 
pain while walking [22,31]. 

Ankle fractures surgically treated normally need sta- 
bilization of the ankle mortise using either staples or a 
screw [8,9,11] and immobilization for six to eight weeks 
after surgery in a plaster cast or in an orthosis [51]. These 
treatments influence the mobility of the ankle. The sur- 
gery technique might diminish the flexibility in the mor- 
tise and perhaps also the range of motion in dorsiflexion 
in the ankle joint. Maximum of dorsiflexion is needed du- 
ring many activities of daily living such as climbing on a 
stool, rising from a chair, walking downstairs, walking up- 
hill, rising from floor etc. Problems in those situations re- 
mind the patient of the injury and would thus be reflected 
in the score. Many authors have reported that patients still 
complained about stiffness in the ankle one year after the 
injury [16,17,22,31]. 

Swelling has been frequently reported in the studied 
patient group [15,17,22,31]. Swelling probably also af- 
fects the experience of stiffness and pain. Both stiffness 
and swelling are graded relatively high in the scoring 
scales, which seems relevant. Running and jumping are 
weighted lower. Ankle fracture occurs in all ages but in- 
creases with higher age [1,6], particularly in women [7]. 
Most ankle fractures happen to persons when stumbling 
or slipping in everyday life rather than to athletes [4]. It 
is thus likely that these functions have less influence in 
the main group of injured persons, which might be the 
reason why these items were weighted lower by the au- 
thors [26]. However, in younger age groups these func- 
tions are probably more important to regain, and then 
the weighting could be discussed. Furthermore, impedi- 
ments or inabilities in return to work or to earlier acti- 
vities of daily life might have a great impact on a 
person's life. This item is weighted high, which seems to 
be correct. 

Despite having relatively few items, the OMAS in- 
cludes all dimensions of ICF recommended by the 
WHO [32]. Pain, stiffness and swelling belong to the do- 
main Body function; stair-climbing, jumping, running 
and squatting to the domain Activity; and work/activities 
of daily life to the domain Participation. OMAS thus 
also fulfills these demands. 

Conclusion 

In conclusion, the results of this study showed that the 
test-retest reliability of the Swedish version of OMAS 
was very high in subjects surgically treated after an ankle 
fracture and the standard error of measure was low. 



Furthermore, the concurrent validity using FAOS and 
GSRF was high. OMAS can thus be used as an outcome 
measure after an ankle fracture. 
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